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It is very common for expressions of the form: 



7 , fii---i k (Xi 1 ,...,X ik ) 



to appear in probability theory. Here {X{\ is a sequence of independent random 
variables taking values in a measurable space (S, S), and {fi! -4 k } is a sequence of 
measurable functions from S k into a Banach Space (B, || • ||). Special cases of this 
type of random variable appear, for example, in statistics in the form of U-statistics 
and quadratic forms. Throughout we will refer to them as generalized U-statistics. 

There is great interest in decoupling such quantities, that is, in replacing the 
above quantity by the expression 



where {X^ 1 '}, {X^}, . . . , {X^} are k independent copies of {X;}. 

Decoupling inequalities allows one to compare expressions of the first kind with 
expressions of the second kind. Such results permit the almost-direct transfer of 
results for sums of independent random variables to the case of generalized U- 
statistics. The reason for this is that, conditionally on {x[ 2 ^}, .., {X^}, the second 
sum above is a sum of independent random variables. It is important to remark that 
such results have led to the development of several optimal results in the functional 
theory of U-statistics (cf. [1] and [7]) and various other areas, including the study 
of the invertibility of large matrices (cf. [2]), stochastic integration (cf. [10]), and 
the study of integral operators on Lebesgue-Bochner spaces (cf. a result of T. R. 
McConnell and D. Burkholder found in [3]). Aside from those directly cited in this 
paper, other important contributors to the area of decoupling inequalities include 
A. de Acosta, P. Hitczenko, J. Jacod, A. Jakubowski, O. Kallenberg, M. Klass, W. 
Krakowiak, G. Pisier, J. Rosinski, and J. Szulga. Due to space restrictions, we refer 
the reader to [10] for a more complete account. 
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In this paper, we announce a result which allows one to compare the tail probabil- 
ities of the above quantities. In particular, this inequality represents the definitive 
generalization of the decoupling inequalities for multilinear forms of McConnell and 
Taqqu [11] and the more general decoupling inequalities for expectations of convex 
functions of U-statistics introduced in [4] . 

Theorem 1. There is a constant C k > 0, depending only on k, such that for all 
n> k. 



(1) 



fi\...ik ( X ii ) • • • ) X ik) 

l<i 17 t---jki k <n 



> t 



< C k P c k 



^ fh -ik ( X h \ ■ • • ) X i' k ^ ) 



> t , for all t> 0. 



Moreover, the reverse inequality holds if the functions satisfy the condition 

fii...i k ( X h i • ■ • 7 X i k ) = ..i„ (Jb) P^Mi) ' • • • ' X i^(k) ) 

for all permutations n of {1, ... ,k}. That is, 
(2) 



P 



y ' fi\...ik ( X ii > • • • > X ik ) 

l<ii#---#ife<n 



> t 




/ j 3%\---%k ' ■ ■ ■ ' , 

l<il#---#ifc<« 



> t 



for all t>0. 



Note that the expression i\ ^ • • • =^ i k means that i r ^ i s for all 1 < r ^ s < k. 

An example illustrating this result can be found in the study of random graphs 
(see also [5]). Given a sequence of independent random points {Xi} in R N , we 
might consider a measure of clustering 

Di= d ( X i> X i)> 

l<i^j<n 

where d{x, y) denotes the distance between x and y. The above result allows us 
to compare D\, which measures the distance "within" the graph formed by the 
random cluster of points {Ai}, to a quantity D 2 , which is a measure of the distance 
"between" the two independent clusters {Xi} and {Xi}, 

D 2 = ]T d{Xi,Xi), 

where {Xi} is an independent copy of {Xi}. Then we have for alH > that 
C^PdD^ > C 2 t) < P{\D 2 \ >t)< C 2 P(\D 1 \ > C^H). 
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Other examples where U-statistics are used in graph theory may be found in [8] . 

We will prove the theorem in the special case that k = 2. For ease of notation, let 
us suppose that Xi = X[ 1] and denote X, t = xf\ The proof of the more general 
result will appear elsewhere. We will use a sequence of lemmas. Following [6], our 
point of departure is equation (4), which provides a partial decoupling result and 
focuses attention on a polarized version of the U-statistic kernel as the key element 
in the development of a solution of the problem at hand. Let 

(3) T n = ^ {fij (Xi, Xj) + fij(Xi, Xj) + fij(Xi, Xj) + fij(Xi, Xj)}; 

l<i^j<n 

then by using the triangle inequality, one obtains that 



(4) 



,Xj)+ fi 3 , Xj ) 



> t 



<P(\\Tn\\ >^)+ 2P 



fij( X i'Xj) 

l<ijtj<n 



> 



This observation reduces the proof of (1) to the problem of obtaining the bounds 



(5) 



and 



(6) 



fij(Xi,Xj) 



> t 



< cP 



fij i X i > X j ) + fij ( X i ' X j ) 

l<i^j<n 



> t 



P{\\T n \\>t)<cP c 



y fij( x i> x j) 

\<i^j<n 



> t 



We obtain (5) by means of Lemma 1 (possibly of independent interest). The proof 
of (6) is somewhat involved. In obtaining it, we used (conditionally) an extension of 
the Paley-Zygmund inequality found in [10] in combination with a symmetrization 
identity similar to the one introduced in [12]. 

Lemma 1. Let X, Y be two i.i.d. random variables. Then 



(7) 



It 

p(||x||>i)<3P(||x + y||>-). 



Proof. Let X, Y, and Z be i.i.d. random variables. Then 

^(11*11 >*) 

= P(\\(X + Y) + (X + Z) - (Y + Z) >2t) 

9t 9t It 

< P(\\X + Y\\ > -) + P(\\X + Z\\ > _)+P(||y + Z|| > -) 

= 3P(||X + F||>|). 
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It is to be remarked that the very desirable "Universal Symmctrization Lemma" , 
P(\\X\\ > t) < cP(c\\X - Y\\ > t), is not true. This makes the above result all the 
more surprising. 

The following is an observation found in Section 6.2 of [10] that will be used in 
combination with Lemma 2 to prove Theorem 1. 

Proposition 1. Let Y be any mean-zero random variable with values in a Ba- 
nach space {B,\\ ■ \\). Then, for all a E B, P(\\a + Y\\ > \\a\\) > j, where, 

K = mi x > eB > ■ 

As a consequence of the above we obtain the following lemma. 

Lemma 2. Let x,a i} bij all belong to a Banach space (B, \\ ■ ||), with bu = 0. Let 
{e{\ be a sequence of independent and symmetric Bernoulli random variables, that 
is, P(ei = 1) = P(ei = — 1) = i. Then, for a universal constant c > 0, 



i=l l<i^j<n 



> llxll 



>c-\ 



Proof. Suppose that ai, bij are in R; then it follows easily from (1.4) of [9] (see also 
Sections 6.2 and 6.5 of [10]) that 

1/2 



/ 


n 




4\ 4 


/ 


n 




2 \ 


E 






J" 


E 








V 


i=l 


l<ijij<n 




\ 


i=i 




J 



for some constant c > 0. Next, observe that ||£||4 < c||£||2 implies that |£|j 2 < 
c 2 ||£||i (since £(£) 2 < (£|£|) 2/3 ■ (E(0 4 ) 1/3 ) ■ The result then follows by Proposi- 
tion 1. 

Proof of Theorem 1. We first transform the problem of proving (6) into a prob- 
lem dealing (conditionally) with a non-homogeneous binomial in Bernoulli random 
variables. Let {et} be a sequence of independent and symmetric Bernoulli ran- 
dom variables independent of {Xi}, {Xi}. Let (Zi,Zi) = (Xi,Xi) if ej = 1 and 
(Zi,Zi) = {Xi,Xi) if a = -1. Then, 



(8) 



4/^, Z S ) = {(1 - + eMviXi, X,) + (1 + Ci )(l + eMviXuXj) 



+ (1 - ei){\ - eMuiXuXj) + (1 + e t )(l - e 3 )f^X u X,)}. 
Setting Q — a{Xi,Xi, i = 1, ...,n), we get 

(9) 4E{fij(Zi, Z 3 )\Q) = {f tj (Xi, Xi) + MX^Xj) + faiX^Xj) + /,,;.V,..V,ii. 
From Lemma 2, (3), (8), and (9), and letting x — T n , it follows that for some c > 0, 



P 4 



> \\Tn\\\g\ >c-\ 
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Integrating over the set {||T„|| > t}, we get 
1 



-P(\\T n \\>t) <P 4 



--P 4 



l<ijij<n 
l<i^j<n 



> t 



> t 



since the sequence {(Xi, X{), i = 1, n) has the same distribution as {(Z{, Zi),i = 
1, n}. The proof is completed by using this inequality along with (4) and (5). 

The proof of (2) is similar and uses an analogue of (8) concerning 4f(Z i7 Zj). 
In obtaining this bound, one does not need to use Lemma 1. Instead one uses the 
symmetry condition on the functions fij, introduced after (1) and equation (3), to 
get 



^ fij{ X i, X j 



>t\=P 



^ fij ( X i i X j ) + fij ( X i ! X j ) 

l<i=^j<n 



> 2t 



<P(\\T n \\>-t)+2P 



^ fij( X i> X j) 
l<i^j<n 
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